notebook.community

Edit and run

不錯的練習

http://www.wildml.com/2016/10/learning-reinforcement-learning/

解釋

https://mpatacchiola.github.io/blog/2016/12/09/dissecting-reinforcement-learning.html

課本

http://ufal.mff.cuni.cz/~straka/courses/npfl114/2016/sutton-bookdraft2016sep.pdf

  • multi arm bandit
  • Q learning
  • SARSA
  • TD

Q vs SARSA https://studywolf.wordpress.com/2013/07/01/reinforcement-learning-sarsa-vs-q-learning/


Content source: tjwei/HackNTU_Data_2017

Similar notebooks:

  • Reinforcement Learning
  • problem-34-checkpoint
  • run_job
  • Untitled
  • ms_ssim_scratch
  • Spectra-Analysis
  • histogram
  • hello-jupyter
  • Stochastic Gradient Descent
  • bulldozer_dl
notebook.community | gallery | about